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A procedure to include the uncertainty on the background estimate for upper limit 
calculations using Poissonian sampling is presented for the case where a Gaussian 
assumption on the uncertainty can be made. Under that hypothesis an analytic 
expression of the likelihood is derived which can be written in terms of polynomials 
defined by recursion. This expression may lead to a significant speed up of computing 
applications that extract the upper limits using Toy Monte Carlo. 
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1 Introduction 

In searches for rare processes where there is no statistical evidence of the signal, 
it is often convenient to combine the results of more independent selection 
channels in order to obtain the upper limit to the number of expected signal 
events. A statistical procedure using a likelihood ratio approach has been 
adopted to combine the results of the Higgs search done by the four LEP 
experiments (0) (@). The likelihood ratio estimator Q is denned as: 



where L(s + b) and L(b) are the likelihood functions in the hypotheses of the 
signal plus background and background only respectively. 
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In the case of a Poissonian sampling, only the number of events passing a num- 
ber of independent selection channels is used as observable to discriminate the 
hypotheses of signal plus background versus background only. The likelihood 
functions L(s + b) and L(b) can be written, for the Poissonian sampling, as 
the product of Poisson probabilities: 



L(s + b) = U- (2) 
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where n c h is the number of selection channels, s, and bi are the expected 
number of signal and background events respectively and n.j is the number of 
selected events. The following simplified expression for Q is more convenient 
for computer computations: 



where s tot = £"=i s;. 

In the case where s is the number of produced events and 6j are the efficiencies 
of each selection s tot = sX^=i e ;- The absolute minimum of — 21og(Q) as a 
function of s gives the most likely value of the branching fraction. In case 
there is no evidence of the signal, it is possible to compute an upper limit at a 
given (usually 90% or 95%) Confidence Level (C.L.) using a Toy Monte Carlo, 
generating a large number of random experiments for different values of the 
signal s. The confidence level for the signal hypothesis can be computed as: 

C.L. S = aL - s+b = Nq ^ Q (5) 
C.L. b N Qb < Q 



where Nq s+1j <q and NQ b <Q are the number of the generated experiments which 
have a likelihood ratio less then or equal to the measured one, in the back- 
ground plus signal and background only hypothesis respectively. 

Background uncertainty can be included in the definition of the likelihood 
applying a convolution with the distribution of the background, which is given 
by the assumed distribution of the background fluctuation. The case where the 
error on the background estimates bi can be assumed to be Poissonian has been 
studied in ref. (|3j). In other cases, the uncertainty may be better considered 
as Gaussian. This is true for instance when the background estimates are 
computed applying subtractions of different samples or applying scaling factors 
affected by gaussian uncertainties. 
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It should be remarked that the method applied in reference (J3|), which is also 
applied in this paper, does not have as good formal basis as it was originally 
thought. Reference (jj) notes that the method is not correct from a frequentist 
point of view, nonetheless, it "seems to be acceptable to many pragmatic 
frequent ist s" . This caveat should be kept in mind when handling the results 
reported in the following. 



2 Including Gaussian background uncertainties in the upper limit 
extraction 



Under the assumption that the uncertainties are Gaussian, equations (2) and 
(3) need to be convolved with a Gaussian function, and become: 



where <jj is the error on the estimate of the background 6j. The integration 
can be extended from — oo to +oo including the unphysical negative signal 
region when the area of the tails of the Gaussian distributions in that region 
are negligible. This is true if <7j is sufficiently smaller than 6j. In that case, the 
integrals are easily manageable analytically. The integration can be performed 
exploiting the following expression for the exponent of the exponential term: 



{b'i-hf , . ^ (K-k + afy , . , at 



+ i>') = :r ,j - (si +h) + 4 
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The intagration variable can be normalized to z — (&• — hi + of)/(Ti so that 
the likelihood functions become: 



+oo 



n-ch i i t 

L( s + b) = l[^=— e -^ +b ^e^ /2 / e- z2/2 { Si + h - o] + a iZ ) ni dz (9) 
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(10) 
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The integrals present in the likelihood functions are all of the form: 

+00 

J e~ z2/2 (a + f3z) n dz (11) 

—00 

and can be computed expanding the polynomial and using: 

/ e~ z2/2 z 2n dz = V2^H(2k+l) (12) 

-"oo k=0 
+00 

/e^V-d^O (13) 

—00 

or can be alternatively computed defining the n-th degree polynomials: 

+00 

p n (a, (3) = — J e' z2 / 2 (a + (3z) n dz . (14) 



-00 



It can be easily shown that the polynomials satisfy the recursion relation: 

Pn(&,P) = ap n -!(a,P) + (n- l)(3 2 p n - 2 (a, f3) . (15) 

The first polynomials are: 

p (a,p) = l 
p 1 (a,(3) = a 
p 2 (a, f3) = a 2 + (3 2 
p 3 (a,[3) = a 3 + 3af3 2 
p 4 (a,[3) = a* + Qa 2 {3 2 + 3[3 4 
p Fj (a, (3) = a b + 10a 3 p 2 + 15ap 4 

The likelihood functions can be rewritten as: 



i=l n - 
L{b) = J] c -(bi-^/2) Pni(bi-CTi,ai) 



(17) 
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The computation of the upper limit with the Toy Monte Carlo can be per- 
formed with a small modification to the code used for the computation in the 
case of no error. The following simplified expression for Q may be convenient: 

Q = L(s + b) = e _ Stot g Pni (s t + b t - al at) 



The usage of polynomials instead of other means of numerical integration 
provides a significant sepeed up of the code in computer calculations that use 
Toy Monte Carlo. 



3 Application to the search for B — > r v in BABAR 



The above method has been applyied in the search for B~ — > t~v using in 
BABAR experiment (JjJ using the events where one B meson is fully recon- 
structed, where the assumptions made in this paper hold. The search has 
shown no evidence for the signal. In order to evaluate the 90% C.L. limit on 
B(B~ — > t~v) , a large number of Toy Monte Carlo experiments have been 
generated for different values of the assumed branching fraction, each corre- 
sponding to an expected number of produced events s. Fig. 1 and. 2 show 
— 21og(Q) and confidence level as a function of B(B~ — > t~v) with and with- 
out including the systematic error on the estimated background. The limit 
obtained including the uncertainty is less stringent than the limit obtained 
neglecting this effect. 



4 Conclusion 



A procedure to include Gaussian uncertainty on the background estimate for 
upper limit calculations using Poissonian sampling has been presented. The 
likelihood can be written in terms of polynominals that can be defined recur- 
sively. This approach makes computing calculation of confidence levels more 
efficient. The technique described in this paper has been applied in BABAR 
for to the search for B~ — > t~v . 
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Fig. 1. — 21og(Q) as a function of B{B~ — ► r~P) with (dashed) and without (solid) 
including the systematic error on the estimated background. 
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Fig. 2. Confidence level as a function of B(B~ — > t~v) with (dashed) and without 
(solid) including the systematic error on the estimated background. 
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